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ON A STATISTIC ARISING IN TESTING CORRELATION 



Walter Kristof 
Summary 

This paper is devoted to the study of a certain statistic, u ; defined 
on samples from a bivariate population with variances ay ^ ; cfgg and cor- 
relation p . Let the parameter corresponding to u be y . Under 
binormality assumptions the following is demonstrated. (i) If cr.^ = oy^ , 
then the distribution of u can be obtained rapidly from the F distribu- 
tion. Statistical inferences about p = u may be based on F . (ii) In 
the general case, allowing for cr^ f ®22. y a cer ^ a ^ n Quantity involving u ; 
r (sample correlation between the variables) and u follows a t distribu- 
tion. Statistical inferences about u may be based on t . (iii) In the 
general case a quantity t T may be constructed which involves only the 
statistic u and only the parameter u . If treated like a t distributed 
magnitude^ t T admits conservative statistical inferences. (iv) The F 
distributed quantity mentioned in (i) is equivalent to a certain t distrib- 
uted quantity as follows from an appropriate transformation of the variable. 

(v) Three test statistics are given which can be utilized in making statistical 
inferences about p = u in the case cr^ = oy^ . A comparison of expected 
lengths of confidence intervals for p obtained from the three test statistics 
is made. (vi) The use of the formulas derived is illustrated by means of an 
application to coefficient alpha. 
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ON A STATISTIC ARISING IN TESTING CORRELATION 1 
1. Introduction 

A paper by Mehta and Gurland (1969) motivated the present study. 

These authors derived the distribution of a certain statistic, u , defined 
on samples from a bivariate normal population. They utilized this statistic 
in estimating the difference of the means of two binormally distributed 
variables when some of the observations corresponding to one of the variables 
are missing. 

Let , X^ "be binormally distributed variables with variance - 

covariance matrix Z - o\ . . Let S = s. .1 be the matrix of sample 

ij - 1 3 

second moments. The statistic u is defined as 

h 2Sl2 

(i; u = 

S X1 + s 22 

The corresponding parameter is 
2 °12 

(2) u = ^ 

' ' 4. rr 

"11 + °22 

2 2 

Evidently, 0 < V .<1 and 0 < u < 1 . Let p be the correlation between 
X^ and Xg • If cr^ = cr^ , then p = u and p = u is the maximum- 
likelihood estimator of p 0 We will assume that u f 1 , i.e., Xg ^ + X^ 

+ const. 


\ 


^This research was supported by the National Institute of Child Health 
and Human Development, under Research Grant 1 P01 HDOI762. 
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The main purpose of this paper consists in deriving means for making 
statistical inferences about u under the conditions c = cfpp and 
<r ^ f crpp . An application of the results to coefficient alpha will be 
appended by way of illustration. A more detailed description is contained 
in the summary. 



2. The Case cr ^ = cr^p 



Set cx = oy >2 = cr and make a transformation of variables : 



( 3 ) 



Y 2 = X x + X a . 

The new variables are again binormally distributed with variance -covariance 

matrix £* = ||cr*.|| , say. Explicitly, 

3 



(k) o^ 1 = 2a (1 - p) 



cr* 2 = 2cr (1 + p) 



°12 = ° 



Let S* = ||st^|| be the matrix of new sample second moments. Evidently, 
(5) s* x = s xl + s 22 - 2s l2 = ( Sll + s 22 )(1 - u) 



s 22 = S 11 + s 22 + 2s 12 = Wl + s 22^ + u ) * 



We have cr* n /o* p = (l - p)/(l + p) and s pp/ s 22 = ( 1 " u )/( 1 + u ) • Hence 



11 ' 22 

the quantity 
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(6) 



1 - p _ 1 + u 

1 + p 1 - u 



follows an F distribution with df = dfg = N - 1 , N indicating sample 
size. It would be a simple matter to obtain the distribution of u explicitly 
from the F distribution by making substitution (6) and using the differential 



(7) 



dF 



1 - p 2du 

1+ 0 ' (1 - u ) 2 



This derivation is more elementary and speedier than approaches starting from 

the Wishart distribution of sample second moments s. . . 

-LJ 

The F test provides a familiar means of testing the homogeneity of two 

random independent sample variances under normality conditions. A hypothesis 

H Q : u = p = p Q is equivalent to homogeneity of ~ Pq) and - s 22^^~ + ; 

2 

the common parameter being 2cr . It follows from ( 5 ) that quantity (6) can be 
employed in testing Hq . Of course, confidence intervals for p may also be 
established by using F . 

Rao (1952, p. 226) suggests the likelihood ratio criterion, L , for test- 
ing homogeneity of variances when any knowledge as to the possible relation- 
ship between the two population variances is missing. In the present context, 
this amounts to the absence of nontrivial knowledge concerning bounds for p . 

One obtains 



( 8 ) 



.N-l _ A 1 ~ p 0 )(l ~ U l 
! - p Q u 



For large samples, the quantity -2&nL is distributed as x with df = 1 
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3. The General Case 

We admit no w o\^ f cr^ . The derivations of the previous section do 
not generalize to the present case. Instead we will adapt a recent develop- 
ment of the sampling theory of reliability estimation by Kristof (1970). 

Using 0 as defined in (2), we make the following transformation of 
variables : 



( 9 ) Y 1 = (/1 - u + /TT~y j)x 1 + (/l 



/i + 



y 2 = (/1 -~ u - yn~v)x 1 + (y 1 - 0 + yi + u)x 



The determinant of the transformation matrix is 4,/l - . This value is 



different from zero since u ^ 1 . The new variables ; Y 0 are 



again 



binormally distributed with variance -covariance matrix ||cr¥*.|| , say. 

Explicitly^ 



(10) o* 1 = 2(1 + Jl - o 2 )^ + 2(1 - /l - u 2 )cr 22 - 4ucr 



12 



°2 2 = 2 ^ 1 " J 1 ~ A)cr + 2 ( 1 + A " w 2 ) 



°22 " ^ U °12 



°2.2 — 2 ^ 2cr lP " u ( or i i °"pp ) J — 0 by (2). 
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Hence the correlation between variables Y and Yg is zero. 

Let S* = ||s*.|| be the matrix of new sample second moments. The quanti- 
1 J 

ties sf are obtained from the original sample second moments s. . by sub- 

1 J 

stitution of and s.. for cr*. and or.. ; respectively; in equations 

-LJ ^*0 

(10). 

2 

Let us consider a hypothesis H^: u = with Uq f 1 . Validity of 

Hq is equivalent to Y^ and Yg being uncorrelated when Uq is used in (9). 
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The t test provides a familiar means of testing a zero correlation between 
binormally distributed variables. Under H the quantity 



follows a t distribution with df = N - 2 . In terms yf the original 
sample second moments ; 



The derivation of expressions (10) does not depend on distributional 
assumptions. Binormality was used in giving (ll) and its equivalent (12). 
However^ the permutation distribution of a sample correlation coefficient 
virtually coincides with its normal-theory distribution when the population 
correlation coefficient is zero (Gayen, 1951)- One may therefore expect 
that the proposed t test (12) Tt will be reasonably powerful for a wide 
range of alternatives approximating normality" and, in the absence of bi- 
normality; remain "in fact very accurate even for small n [sample size]" 
(Kendall 8c Stuart; 1961; pp. V[5-^76). 

Of course; the construction of confidence intervals for o may also 
be based on (12) when u and r are given. 

Application of formula (12) requires knowledge of both u and r . 

Let us consider the situation when just u is given and inferences concern- 



(n) t 




yN - 2 




u - u 



0 



(12) t 



Ui/1 - u 





ing u are to be made. 
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Suppose we wish to test a hypothesis 11^: u - with / 1 

Again validity of li^ is equivalent to Y n and Y 0 being uncorrelated 



when Uq is used in (9); P* 



1 ‘2 

In analogy to the definition of u we 



introduce under H the new statistic 



(13) 



Pc;* 

S l2 



s * 4. s ^ 
S 11 22 



u - u 



1 - U\J, 



Now we have reduced the problem to the following. We are to test a hypoth- 
esis H*: p* ■■ 0 on the basis of the "observed" value u* . Unfortunately, 

the distribution of u* under H* depends on the unknown ratio cr*^/<T*g 
as may be seen from formulas (l8) and (19) in Mehta and Gurland (1969). 

Let us determine the set of all possible sample correlation coefficients 
r* that are compatible with a given value u* when s q]y s 22 ® ^ 

coefficient r* is compatible with a given u* precisely when there are 

numbers a,b > 0 and c such that u* = 2c/(a + b) , r* ~ c/>/ab and 
2 

ab - c > 0 . It is a rather simple matter to show that (i) u* - 0 is 
equivalent to r* = 0 , (ii) for u* f 0 given, all compatible r* have 
the sign of u* and satisfy precisely |u*| < |r*| <1 . Hence the 
absolutely smallest r* compatible with a given u* is u* itself. 

It follows that the quantity 



u* 



/ 



1 - u* 



y n - 2 



u 



u 



0 




yw - 2 



(lU) t’ 
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may be used for testing u - Uq when it is treated like a t distributed 

magnitude with df - N - 2 . This procedure will be conservative in the sense 
that it will be harder to reject 1 -Iq than when both u and |r| >0 are 
given and (12) is invoked. We will always have |t*j < |t| . The relation 

between t and t 1 is found to be 

/ncN ^ ^ s ll ■ S 22 ^ 

( 15 ) t=t'/l+ g — 

J 4 ( S 11 S 22 - S 12^ 

The radicand depends on both the difference s - s^g and th e determinant 

Is | . 

4. Further Discussion of the Case o\^ = cr^ 

In the psychometric literature the division of a test into two parallel 
parts (cr-^ ~ cr 22 ; correlation p between parts) has received particular 
interest presumably because the reliability p, of the total test is then a 

"G 

simple monotonic function of p alone, namely, p = 2p(l + p) ^ . Statistical 

"G 

inferences about p^_ may be based upon statistical inferences concerning p . 

The assumption o\^ = u 22. eY ^ en ^y equivalent to u = p and, as 
follows from (10), also to cr*^ = cr^g . The results of sections 2 and 5 
provide us now with at least three different formulas that may be used when 
inferences about p in the case <y^ = cr^ are sought. These are: 



(6) 



1 - p . 1 + u 
1 + p 1 - u 



df = dfg = N - 1 








0 



( 12 - ) 



t 



df = N - 2 



